A Holistic Approach to the Evaluation of Data Warehouse Maintenance Policies
نویسندگان
چکیده
The research community is addressing a number of issues in response to increased reliance of organisations on data warehousing. Most work addresses individual aspects related to incremental view maintenance, propagation algorithms, consistency requirements, performance of OLAP queries etc. There remains a need to consolidate relevant results into a cohesive framework for data warehouse maintenance. Although data propagation policies, source database characteristics, and user requirements have been addressed individually, their co-dependencies and relationships have not been explored. In this paper, we present a comprehensive, cost-based framework for evaluating data propagation policies against data warehouse requirements and source database characteristics. We formalize data warehouse specification along the dimensions of freshness (or staleness), response time, storage, and computation cost, and classify source databases according to their data propagation capabilities. A detailed cost model is presented for a representative set of policies. A prototype implementation has allowed an exploration of the various trade-offs. The results presented in this paper are for a single source, but the approach and the framework are extensible. Current work is addressing a broader class of sources and a more detailed data warehouse specification that includes multiple sources.
منابع مشابه
افزایش سرعت نگهداری افزایشی دید با استفاده از الگوریتم فاخته
Data warehouse is a repository of integrated data that is collected from various sources. Data warehouse has a capability of maintaining data from various sources in its view form. So, the view should be maintained and updated during changes of sources. Since the increase in updates may cause costly overhead, it is necessary to update views with high accuracy. Optimal Delta Evaluation method is...
متن کاملA Systematic Approach to Selecting Maintenance Policies in a Data Warehouse Environment
Most work on data warehousing addresses aspects related to the internal operation of a data warehouse server, such as selection of views to materialise, maintenance of aggregate views and performance of OLAP queries. Issues related to data warehouse maintenance, i.e. how changes to autonomous sources should be detected and propagated to a warehouse, have been addressed in a fragmented manner. A...
متن کاملChange Detection and Maintenance of an XML Web Warehouse
The World Wide Web contains a huge and increasing volume of information. The web warehouse is an efficient and effective means to facilitate utilization of information on the Web, not only to individual users but also to business organizations, especially for decision-making purposes. On the other hand, XML has recently become the new standard for representation and exchange of data on the Web....
متن کاملارزیابی سیستم اطلاعات انبار دارویی مراکز آموزشی درمانی شهر تهران
Introduction: Pharmaceutical Warehouse Information Systems is the software that provides drug preparation and maintenance operations from the order stage to delivery stage. To evaluate this system, different aspects of the system should be taken into consideration. In this study, the features and functionality of the system from the managers’ and users’ perspectives were examined. Methods: Thi...
متن کاملBenchmarking of Data Warehouse Maintenance Policies HS-IDA-MD-00-001
Many maintenance policies have been proposed for refreshing a warehouse. The difficulties of selecting an appropriate maintenance policy for a specific scenario with specific source characteristics, user requirements etc. has triggered researcher to develop algorithms and cost-models for predicting cost associated with a policy and a scenario. In this dissertation, we develop a benchmarking too...
متن کامل